Human-powered sorts and joins Citation

نویسندگان

  • Adam Marcus
  • Eugene Wu
  • David Karger
  • Samuel Madden
  • Robert Miller
چکیده

Crowdsourcing marketplaces like Amazon’sMechanical Turk (MTurk) make it possible to task people with small jobs, such as labeling images or looking up phone numbers, via a programmatic interface. MTurk tasks for processing datasets with humans are currently designed with significant reimplementation of common workflows and ad-hoc selection of parameters such as price to pay per task. We describe how we have integrated crowds into a declarative workflow engine called Qurk to reduce the burden on workflow designers. In this paper, we focus on how to use humans to compare items for sorting and joining data, two of the most common operations in DBMSs. We describe our basic query interface and the user interface of the tasks we post to MTurk. We also propose a number of optimizations, including task batching, replacing pairwise comparisons with numerical ratings, and pre-filtering tables before joining them, which dramatically reduce the overall cost of running sorts and joins on the crowd. In an experiment joining two sets of images, we reduce the overall cost from $67 in a naive implementation to about $3, without substantially affecting accuracy or latency. In an end-to-end experiment, we reduced cost by a factor of 14.5.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human-powered Sorts and Joins

This paper describes how crowds are integrated into a declarative workflow engine called Qurk to reduce the burden on workflow designers. The authors focus on how to use humans to compare items for two of the most common operations in DBMSs: sorting and joining data. Basic query interface and the user interface of the tasks posted to Amazon’s Mechanical Turk were described. They also propose a ...

متن کامل

Human-powered Sorts and Joins

Crowdsourcing markets like Amazon’s Mechanical Turk (MTurk) make it possible to task people with small jobs, such as labeling images or looking up phone numbers, via a programmatic interface. MTurk tasks for processing datasets with humans are currently designed with significant reimplementation of common workflows and ad-hoc selection of parameters such as price to pay per task. We describe ho...

متن کامل

Critique of Human-powered Sorts & Joins

Marcus et al. (2011) describe using Amazon’s Mechanical Turk (MTurk) systems utilizing crowds for database sorts and joins. Crowdsourcing activities for these types of queries are typical costly in order to ensure accuracy and reliability. The authors present an interface for optimizing these functions in order to determine cost-effectiveness of various methods (simple join, naïve batching, and...

متن کامل

Optimization techniques for human computation-enabled data processing systems

Crowdsourced labor markets make it possible to recruit large numbers of people to complete small tasks that are difficult to automate on computers. These marketplaces are increasingly widely used, with projections of over $1 billion being transferred between crowd employers and crowd workers by the end of 2012. While crowdsourcing enables forms of computation that artificial intelligence has no...

متن کامل

Feasibility Study of Building a Human Powered Hydrofoil Vessel

In this paper, a feasibility study of building a Human Powered Hydrofoil (HPH) vessel is reported. Hydrofoil vessels are a well-known class of high-speed crafts. In addition to high-speed operation, the hydrofoils have a reliable maneuvering capability, good stability and proper operation in waves. Also, a human powered vehicle, nowadays is an advancing idea. Different aspects of the design and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011